Scene and Object Recognition with Supervised Nonlinear Neighborhood Embedding
نویسندگان
چکیده
Image category recognition is important to access visual information on the level of objects and scene types. In this paper, we develop a Supervised Nonlinear Neighborhood Embedding (SNNE) subspace algorithm of different visual features for object and scene recognition, which learns an adaptive nonlinear subspace by preserving the neighborhood structure of the visual feature space. In the proposed subspace algorithm, we combine the idea of nonlinear kernel mapping and preserving the neighborhood structure of the samples, so it can not only gain a perfect approximation of the nonlinear image manifold, but also enhance within-class neighborhood information. So, the proposed SNNE algorithm models the ensemble of visual features to a more discriminative space for category recognition, and at the same time, can effectively combine several visual features to improve recognition rate. The proposed method is evaluated by using the scene database (SIMPLicity) and object recognition database (Caltech). We confirm that the proposed method is much better than state-of-the-art methods only with simple visual features.
منابع مشابه
Orthogonal Tensor Sparse Neighborhood Preserving Embedding for Two-dimensional Image
Orthogonal Tensor Neighborhood Preserving Embedding (OTNPE) is an efficient dimensionality reduction algorithm for two-dimensional images. However, insufficiencies of the robustness performance and deficiencies of supervised discriminant information are remained. Motivated by the sparse learning, an algorithm called Orthogonal Tensor Sparse Neighborhood Embedding (OTSNPE) for two-dimensional im...
متن کاملFisher Discriminant Analysis (FDA), a supervised feature reduction method in seismic object detection
Automatic processes on seismic data using pattern recognition is one of the interesting fields in geophysical data interpretation. One part is the seismic object detection using different supervised classification methods that finally has an output as a probability cube. Object detection process starts with generating a pickset of two classes labeled as object and non-object and then selecting ...
متن کاملGeneralized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval
In this paper, the problem of multi-view embedding from different visual cues and modalities is considered. We propose a unified solution for subspace learning methods using the Rayleigh quotient, which is extensible for multiple views, supervised learning, and nonlinear embeddings. Numerous methods including canonical correlation analysis, partial least square regression, and linear discrimina...
متن کاملImage processing is the process of analysis and manipulation of a digitized image in order to improve its quality. Two principles of Image processing are improvement of pictorial information and processing of scene data. Recognizing the semantic category of complex
Scene recognition provides visual information from the level of objects and the relationship between them. The main objective of scene recognition is to reduce semantic gap between human beings and computers on scene understanding. For example, recognize the context of an input image and categorize it into scenes (forest, seashore, building etc). Some of the applications of scene recognition ar...
متن کاملKernel generalized neighbor discriminant embedding for SAR automatic target recognition
In this paper, we propose a new supervised feature extraction algorithm in synthetic aperture radar automatic target recognition (SAR ATR), called generalized neighbor discriminant embedding (GNDE). Based on manifold learning, GNDE integrates class and neighborhood information to enhance discriminative power of extracted feature. Besides, the kernelized counterpart of this algorithm is also pro...
متن کامل